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Abstract 

Recently Drmota and Gittenberger (1997) proved a conjecture due 
to Aldous (1991) on the height profile of a Galton- Watson tree with 
an offspring distribution of finite variance, conditioned on a total size 
of n individuals. The conjecture states that in distribution its shape, 
more precisely its scaled height profile coincides asymptotically with 
the local time process of a Brownian excursion of duration 1. We give 
a proof of the result, which extends to the case of an infinite variance 
offspring distribution. This requires a different strategy, since in the 
infinite variance case there is no longer a relationship to the local time 
of Brownian resp. Levy excursions. 
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1 Introduction and main result 

In this paper we analyse the shape of a Galton- Watson tree, conditioned to 
have a total number of n individuals. More precisely we study its asymptotic 
height profile, as n — > oo. 

By a tree t we mean a rooted, ordered, finite tree. We consider the 
vertices to represent individuals, such that t can be regarded as the family 
tree of the progeny of some founding ancestor (the root). It is assumed 
that among siblings there is an order (of birth), which allows to imbed 
such trees into the plane. The total number of vertices of t, its size is 
denoted by s(t). We shall be interested in the way, in which the size of the 
different generations vary along the tree. An individual i belongs to the fe'th 
generation, if the path from i to the root contains exactly k edges. Let 
denote the number of individuals in generation k = 0, 1, The sequence 
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1 = zo, z\, Z2, ■ ■ • is called the height profile of t (possibly after a suitable 
renormalisation) . 

For a Galton- Watson tree T the number of children of the different in- 
dividuals are assumed to be independent and identically distributed ran- 
dom variables. p x , x = 0, 1, . . . denotes the probability, that an individual 
possesses x children, and is the number of individuals in generation k. 
We shall consider T, conditioned on a size s(T) = n. Such a conditioned 
Galton- Watson tree will be abbreviated as CGW(n)-tree. Since it contains 
n individuals, it is natural to consider its height profile 

H u = a n 1 Z H«J' u - °> 

rescaled with some sequence (a n ) of positive numbers. The aim is to choose 
(a n ) such, that H n converges in distribution. Then a n gives the magnitude 
of the breadth of the CGW(n)-tree, whereas n/a n indicates the order of 
h(T) = max{/c | > 0}, the height of the CGW(n)-tree. - We assume: 

Assumption A 

1. ) The offspring distribution (p x ) x has mean 1: 

oo 

^^xp x = 1. 

2=0 

The greatest common divisor of all x with p x > is 1. 

2. ) There are positive numbers a n such that a~ 1 (^i + • • • + £n — n) converges 
in distribution to a non- degenerate limit law v, as n — )• oo. Here £i,£2 ; • • • 
denote independent random variables with distribution (p x )x- 

The case of an offspring distribution with a finite mean can be treated in 
much the same way and is not a genuine generalisation (see Kennedy |25j). 
Similar the assumption on the g.c.d. may be removed. Criticality and a 
g.c.d. equal to 1 are assumed just for convenience. 

The second assumption has been completely analysed. It has several 
implications, which are discussed in chapter XVII in Feller |16j and in the 
monograph of Gnedenko and Kolmogorov [18]. In particular (p x ) is in the 
domain of attraction of a stable law. Its index a belongs to (1,2], since we 
deal with an offspring distribution of finite mean. This implies a n = o(n). 
The limit law v = v a is determined by a up to a scaling constant. For a = 2 
this is simply the normal law, and for 1 < a < 2 it is a one-sided stable law, 
because the negative tail of the offspring distribution is zero. 

We shall prove, that under these assumptions the height profile H n con- 
verges in distribution, where the a n are just the numbers, given in the as- 
sumption. The limiting process H = (H u ) u turns out to be a functional 
of certain normalized excursions: H = ip(Y). By a normalized excursion 
we understand a process Y = (Xs)o<s<i> such that Yq = Y% = and 
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inf ,5<s<i— <5 Y s > for all 5 > 0. H will be obtained from Y as follows: 
The corresponding cumulative height profile 

f-U 

C u = H v dv, u > 0, 
Jo 

is given by 

f dt 

C u = supjs < 1 : / — <u\. 

Jo y t 

H, having a.s. paths continuous from the right, is completely determined 
by the cumulative height process C. If f 0+ dt/Y t < oo, then h > and 
H u > for all u E (0, h), also C u — > 1, as u — > /i. Then we say, that H is 
non- degenerate. The case J Q , dt/Yt = °°> implying C = and il = 0, is in 
the sequel of no significance. - The same transformation already appears in 
the paper |29j by Lamperti, who used it to transform Levy-processes into 
continuous branching processes. Our theorem is in a way a conditioned 
version of Lamperti's result. 

The normalized excursions are obtained as follows. There is a unique 
Levy-process X = (X s ) s >q (a process with independent and stationary in- 
crements), such that Xq = and the law v is the distribution of X\, and 
to each of these processes there belongs a normalized excursion Y. This is 
explained in detail in Bertoin's monograph [5 J (in particular chapter VIII. 4). 

Theorem 1 Let H n be the scaled height profile of a CGW(n)-tree, satisfy- 
ing assumption A. Then, as n — > oo, H n converges in distribution to the pro- 
cess H = ip(Y), derived from the corresponding normalized Levy-excursion 
Y. H is a.s. non- degenerate. 

In general the excursions contain jumps. These jumps also show up in 
H. Thus we regard H n and H as random elements in the space of cadlag- 
functions, endowed with the usual Skorohod J\ topology (compare [14"]). 

A neat case is that of an offspring distribution with finite variance: 

oo 

a 2 = x2 Px — 1 < oo. 

x=0 

Then, choosing a n = cm 1 / 2 , Y is a normalized Brownian excursion. In 
this situation the limiting process allows another appealing description: 

(H u ) u = (t}L u / 2 )u, where L denotes the local time process of a normal- 
ized Brownian excursion. The reason is: Up to a factor 1/2 L is related to a 
normalized Brownian excursion in the same way, as H is derived above from 
Y, which follows from a result of Jeulin [19] (compare Biane [6], Theoreme 
3). This version of Theorem 1 has been observed by Aldous in special cases 
(as the geometric offspring distributions) and conjectured in the general fi- 
nite variance case, compare [3]. A first proof of the conjecture was given 
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by Drmota and Gittenberger [11]. They mastered the formidable task to 
obtain convergence of the finite-dimensional distributions as well as tight- 
ness, using generating functions and thereby generalizing work of Kennedy 
|25j on the one-dimensional distributions. Pitman [33] surrounded the diffi- 
culties by imbedding the problem into the context of convergence of strong 
solutions of stochastic differential equations. The limiting distribution of 
the maximum of the height profile, the 'width' of the family tree, has been 
found in the finite variance case already by Takacs [39]. 

The relations to stochastic analysis can be further developed in the case 
a 1 < oo. Note that the defining equation for C can be written as 

f Cu dt 

u = — for < u < h, and C u = 1 for u > h, 
Jo Y t 



where 

r 1 dt 

h 



o Y t 

h is the asymptotic height of the rescaled tree, as will become clear in the 
next section. By differentiation with respect to u we get 

H u = ^ = Y(C U ). 
au 

Now a Brownian excursion Y solves the stochastic equation dY = dW + 
(1 — jzt)yi with a standart Brownian motion W (compare [36], chapter IV, 
section (40.4)). Viewing C as a time-change, this leads to the stochastic 
equation 

2 



/ tt2 \ 

dH = VWdB + ( i - ) dt ' 



with a standart Brownian motion B. Pitman [34J also obtains this equation 
and discusses it in detail, therefore there is no need to trace this aspect 
further. The idea of using C as a time-change goes back to Lamperti [29] . 

The mentioned proofs all focus on the relationship to Brownian local 
time. In contrast we shall not rely on local times in this paper. (A short 
description of our proof in the finite variance case already appeared in the 
technical report [26].) The reason is that in the case a < 2 the connection to 
local times breaks down. Then Y and consequently H exhibits a.s. jumps. 
Since local time processes of Levy-processes (if existent) have a.s. continuous 
paths (compare f.e. [5], chapter V.l), they are no longer suited. 

This can be explained on a heuristic level, too. Consider the following 
construction, going back to Harris [23] and used by different people. Tra- 
verse the individuals of T in the following manner: From individual i pass 
over to its oldest child, which has not yet been visited, resp. return to 
its predecessor, if all children of i have already been visited. This gives a 
traversal through T with the root as starting and end point. Each edge is 
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passed twice, once forward and once backward. Next consider the associated 
random path, which increases one unit, if we change over to a child, and 
decreases one unit, if we go back to a predecessor. In the case of a geometric 
offspring distribution we get a true random walk excursion, conditioned to 
return to zero after 2n steps for a CGW(n)-tree. This is due to the lack of 
memory of the geometric distribution. The number of upcrossings from level 
A; — 1 to level k is equal to the size of the fc'th generation, which makes the 
relation to local times obvious in this situation. In general the random path 
exhibits complicated dependence properties. In the case a 2 < oo they are of 
a local nature and vanish in the limit n — > oo, as was shown by Aldous, such 
that the asymptotic height profile can still be described by Brownian local 
time. For a < 2 however, the dependence structure survives in the limit. 

The combinatorics of CGW(n)-tvees have been widely studied by means 
of generating functions (see f.e. [HI [171 [251 EI] ) • Probabilistic methods have 
been introduced in Kolchin [28] and in particular by Aldous [21 [3]. Our proof 
of theorem 1 is based on two probabilistic constructions, which are valid for 
the infinite variance case, too. The first one will be described in section 2, it 
establishes a connection between Galton- Watson trees and suitable random 
walk excursions. Though the relationship has been known for quite a while 
(compare [10J ) , the scope of this approach has been enlarged considerably 
only recently (see [H [20] ) . In our context it allows to reduce convergence 
of H n to convergence of excursions. The required continuity theorem will be 
developed in section 3. The second probabilistic construction, which will be 
presented in section 4, is size-biasing of Galton- Watson trees. We use it to 
describe the bottom of the trees, which is of some interest of its own and will 
help to check a main condition of the continuity theorem. This concept goes 
back to Geiger |21| . who developed a construction due to Lyons, Pemantle 
and Peres [30J. Section 5 addresses the question of convergence of excursions. 

Thus the height profile will be considered as a functional of a random 
walk excursion S. Other quantities of the tree can be viewed as well as 
functionals of S. In this manner we can also treat the height profile of 
random forests, as discussed by Drmota and Gittenberger [12] and Pitman 
|34j in the finite variance case. We contend ourselves by stating a version of 
the theorem, which is valid for infinite variances, too. A conditioned random 
forest consists of I Galton- Watson trees, conditioned to contain altogether n 
individuals. Let now Z^ be the total number of all individuals in generation 
k in one of the I trees. Suppose that I ~ ja n , as n — >• oo, with 7 > 0. 
Then the height profile (H™) u = (a~ Z\ nu / an ]) u converges in distribution. 
The limiting process can be described as follows. Let X 7 = (X 7 , s )o< s <i be 
a Levy-process as above, now conditioned to hit —7 at the moment s = 1 
for the first time. It is built up from the excursions 

Y V = ( Y ri,s)s<L v = {X s + rj)T n <s<T v +, < 7] < 7, 

with T v = inf{s : X s = —n}, T r]+ = inf{s : X s < —n} and L v = T ri+ — T r] . 
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Then the limiting process is given by 



XI H v,u, u>0, 

0<r?<7 

where H^^ is derived similarly as above from 

f s dt 

C VtU = sup{s < L v : / — < u}. 

Jo i v ,t 

The sum is a.s. finite for every u > 0, which reflects the fact, that also in the 
limit only finitely many trees contribute to the height profile. This result 
can be proved in much the same manner (and with only little additional 
effort), as we shall obtain Theorem 1 below. For 7 = we are back in the 
situation of Theorem 1. 



2 Trees and Random Walk Excursions 

It has been known for some time, that Galton- Watson trees can be imbedded 
into random walks (compare [8] and the references therein) . This is implicit 
in Dwass' important paper |10| . Here we give a combinatorical treatment, 
which allows generalization. Let T be a tree of size n. Suppose that we 
label the individuals in T with the numbers 1,2, ... ,n such that the root 
gets label 1, the individuals in the first generation the labels 2, . . . , Zq + Z\ 
(say from left to right), the individuals in the second generation the labels 
Zq + Z\ + 1, . . . , Zq + Z\ + Z2 and so forth. Using these labels we define a 
random path S = (S(0), S(l), . . . , S(n)) recursively by 

5(0) = 1, S(i) = S(i - 1) + & - 1, % = 1, . . . n, 

where £j denotes the offspring number of the individual with label i. We can 
imagine that the path arises as follows: To its i'th increment individual i 
contributes the downward step —1, whereas each of its children contributes 
one step +1 upwards. Then each individual is responsible for one upward 
and one downward step, except the root, which has no predecessor and thus 
contributes only a step downwards. Since S(0) = 1, 

S(n) = 0. 

Furthermore, individuals always have smaller labels than their offspring, 
therefore the upward step of an individual appears before its downward 
step. Clearly this implies 

S(i) > for all i < n, 

i.e. S is an excursion of length n. Conversely, given such an excursion S of 
length n, with S(i) > S(i — 1) — 1, we can construct a tree, fitting to the 
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excursion S. Namely, from S we read off & = S(i) — 5(i — 1) + 1, and the 
given labelling rule allows us to grow the tree from its root. Thus there is 
a one-to-one correspondence between trees of size n and excursion of length 
n. 

As to the probabilistic aspect of the construction note, that for a Galton- 
Watson tree the £j are independent random variables, such that S becomes 
an ordinary random walk excursion. Likewise S is a random walk excursion 
of duration n, if T is a CGW(n)-tiee. 

Remarks 1.) These considerations remain valid for other ways of 
labelling. One possibility is to label according to depth-first search, which 
has been exploited in [4, 20J. In general the following properties are required: 

i) 1 is the label of the root. 

ii) The individuals with labels 1, . . . , i form a subtree for any i < n. In other 
words: Any individual has a smaller label than any of its children. 

iii) Given the subtree with labels 1, . . . ,i and the numbers £i, . . . , there is 
a rule, which specifies, which child of the individuals 1, ... ,i gets the label 
i + l. 

2.) A random forest of I trees and n individuals can be described by a 
random walk path S with 5(0) = i, S(i) > for i < n and S(n) = 0. If we 
label the trees one after the other, then the r'th tree is represented by the 
part of the random walk between the hitting times of I + 1 — r and I — r. 

In our labelling 1, . . . , Zq + Z\ + ... + Zk-i are just the members of 
generation to k — 1. They contribute a negative step to S(Zq + Z\ + . . . + 
Zk-i). Their children, i.e the individuals in generation 1 to k, add a positive 
step. Therefore 

Zk = S(Z + Z\ . . . + Zk-i). 

This is the announced random walk representation for the height profile, 
which has been used by several authors (see 0). We transform it into a 
differential equation. Define 

u 

a n Z[nv/a„] d-V, 

in particular 

C ka n /n = ~(Zo + ... + Z k _!). 

Further let for < s < 1 

S n (s) = a-^CH) 

and 

Y n (s) = S (C£ an / n ) for s e [Cjfc an / n )C(fe + i) an / n )- 
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Then the above equation translates into the ordinary differential equation 



which by integration leads to 
'' c ™ dt 



o 



Y n (t) 



for u <h n and C" = 1 for u > h r , 



with 

, dt 

h. 



Y n {t)' 

h n obviously is the height of the rescaled tree. - It is now our plan to 
reduce the question of convergence of H n and C n to that of S n and Y n . 
The next section provides the required continuity statement. 



3 A Continuity Theorem 

Let D and D' be the spaces of all cadlag- functions / : [0, 1] — > M resp. 
g : [0, oo) —> M. By endowing them with the common Skorohod-distance 
(compare [13]) we make them to complete metric spaces. Thus functions f n 
converge to / in D, if there are increasing bijections a n : [0, 1] — > [0, 1], such 
that, as n — > oo, 

sup \a n (t) - t\ -4- 0, sup \f n (t) - f(a n (t))\ -)■ 0. 

t t 

Similarly g n — > g in D' , if there are increasing bijections f3 n : [0, oo) — > [0, oo), 
such that for all v < oo 

sup \P n {u) - u\ -)■ 0, sup \g n {u) - g(/3 n (u))\ -4 0. 

If g is a continuous function, we may choose /3 n (u) = u. 

We begin by collecting some analytical facts. Let D + be the space of 
non- negative / G D. 

Lemma 2 Let f, f n € D + . 

i) For given s the mapping f h-> J q s dt/f(t) (with the possible value oo) is 
lower- semicontinuous (and thus measurable). 

ii) If inf a < 4 <;, f(t) > for some < a < b < 1, and if f n — > f , then 

JaW)~ Ja fn(t) ^ ' 



sup 

a<s<b 
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Proof Let a n as above. As is well-known, it may be assumed without 
loss of generality, that they are differentiable functions such that 

sup \a n (t) — l\ — > 0. 
t 



i) If f n — > f, then for S > by Fatou's Lemma 

dt 



L 



s~S ^ />s-<5 
< liminf 



< liminf 



/(*) ~ n J Q UaVMt)) ~ n 

and consequently 



a n (t) dt 



dt 



< liminf 



dt 



lo f^) "" n "" Jo fn(t)' 
ii) Due to uniform convergence of the integrands 



sup 

a<s<b 



r dt r - 

J a fnit) J a /(«n 

Further, substituting a n (t) = w 

dt dw 



it)) 



0. 



dw 

L fia n (t)) J a a' n (t)f(w) ' J a f(w) 
uniformly for all s £ [a,b]. 

Define now for / € D + a function g = </>(/) € D' by 



g(u) = sup{s < 1 | J < u}. 



Thus as above 



u 



-L 



o fit) 



with 



for u < h, and g(u) = 1 for u > h, 
4 dt 



h = I 

Jo 



fitY 



□ 



g is continuous and increasing. It is everywhere differentiable from the right 
(since / is continuous from the right), and the derivative is given by 

d + d+ 

—giu) = f(g(u)) for u < h and —g{u) = for u > h. 

ii/Uj XJjUj 



We denote 
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Lemma 3 Suppose /«—>/*« D + , sup u \<f>{f n )(u) — 4>{f)(u)\ — > and 
/o* < 00 /or aZi s < 1. Then V>(/„) -»• V(/) »« D' . 

Proof Denote 5 = <£(/), g n = 4>{f n ) and h n = /J dt/f n (t). Let a n (t) 
be as above. The required bijections j3 n : M+ — > M + are defined as 



Then 



9 1 {a n {9n{u))) for u < h n , 
(3 n (h n ) + (u - h n ) for u > h n . 



sup |#(/3 n (u)) - g n (u)\ = sup |a n (g„(«)) - g n (u) 

u<h n u<-h n 



< sup \a n (t) — t\ — > 0. 



t 



By assumption it follows 

sup \g{p n {u)) - g(u)\ -»• 0. 

If /t = Jq 1 dt/f(t) < 00, then g -1 (s) = J S dt/f(t) is uniformly continuous on 
[0,1], and it follows 

sup \f3 n (u) — u\ = sup \/3 n (u) — u\ — > 0. 

« u<h n 

If on the other hand h = 00, then /i n — >■ 00 and g -1 is uniformly continuous 
on every intervall [0, s] with s < 1. In this case we may conclude 

sup \/3 n (u) — u\ — > 

for every u > 0. This is one of the desired properties. 

Next for u > h n we have 1 = g n (h n ) = g n {u) and 1 = a n (l) = 
a n (gn(h n )) = g(Pn{h n )) = g{Pn(u)), therefore 

sup\f n (g n (u)) - f(g((3 n (u)))\ 

u 

= sup \ fn(g n (u)) - f(g(p n (u)))\ 

U<h n 

= sup \fn(g n (u)) - f(a n (g n (u)))\ 

u<h n 

= sup\f n (t)-f(a n (t))\^0. 
t 

Since ip(f) = f o g and ip(f n ) = f n o also 

sup|V(/n)H-^(/)(^(n))|^0, 

u 

which proves the claim. □ 
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We are now ready to prove the main result of the section. Let Y, Y n be 
stochastic processes with paths in D + , and define processes C, C n , H and 
H n by 

C u = <P(Y)(u), CZ = 0(Y n )(u), 
H u = ^(Y)(u), HZ = i>(Y n )(u). 

Theorem 4 Assume inis< s <isYs > a.s. for all 5 > and, as e — > 0, 

limsupP n (C™ < e) -> 

n 

for all u > 0. Then C u > a.s. for all u > 0, i.e. H is non- degenerate, and 
C n — > C , H n —7- H in distribution, osn->oo. 



Proof Due to a wellknown theorem of Skorokhod (see [35], chapter 
IV. 3, Theorem 13) we may assume that the processes Y n and Y are defined 
on a single probability space (0, A, P), and that Y n — > Y a.s. in D + . Because 
of semicontinuity (Lemma 2 i)) and the definition of C it follows C u > 
limsup n C^ a.s.. From Fatou's Lemma 

P(C U < e) < P(limsupC£ < e) < limsupP(C™ < e). 



Thus our assumptions imply C u > a.s. for all u > 0. 



Next let C u < 1. Then C£ < 1 for large n, therefore n = J^ u dt/Y t 



r o °« dt/Y t n . It follows 

|C?-C„| < maxY r 



= max Y n 

Therefore for any a < 1 and b such that = a 

sup|C£-C u | < sup|C-C| + (l-C 6 n ) + (l-C b ) 

u u<b 



dt 


r Cu dt 


\rn 
I t 


Jo 


dt 


r Cu dt 


Y t ~ \ 


L YT 


= a 





< 2sup|C^-C u |+2(l 

u<b 



< 2maxy n sup 

s<a 



dt 
Y 



dt 
YT 



+ 2(1 -a). 



Now max Y n — > max Y a.s.. In view of Lemma 2 ii) it follows for every rj > 
there is a v > (chose a sufficiently close to 1), such that for all e > 

limsup P(sup|C™ — C u \ > 7]) 



< rj + limsup P( 



dt 

Y 



dt 



> v) 



< n + V(C v < e) + limsup P(C£ < e) 

n 

< n + 2 limsup P(C™ < e). 
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By assumption the righthand side can be made arbitrarily small, such that 
sup u \ C U — C™| —7- in probability. In view of Lemma 3 H n — > H in proba- 
bility, and the proof is finished. □ 



4 Size-biased Galton— Watson Trees 

In this section we verify one of the conditions of Theorem 4 for height profiles 
of a CGW(n)-tree. We shall make use of size-biased trees, as they have 
been constructed by Geiger for binary branching trees resp. splitting trees 
(compare [21]). Let 

oo 
x=0 

be the generating function of the offspring distribution (p x ). Fix A £ (0, 1) 
and consider the probability weights 

q x = \ x p x /(f(\). 

Then we have for a Galton Watson tree T the formula 

Q(T = t) = A s «-V(A)- S(t) ]Jp d{l) = A'W-VCA)-® P(T = t), 

i i 

where d(i) denotes the number of children of individual i in the (nonrandom) 
tree t, and P and Q denote the probability measures, corresponding to the 
offspring distributions (p x ) resp. (q x )- It follows the known fact (compare 
[2"5] ) that two CGW(n)-tvee with offspring distributions (p x ) and (q x ) are 
equal in distribution. Passing over to (q x ) has the effect that the total size 
of T becomes finite in mean ([15] . chapter XII. 5): 

E QS (T) = (1 - /i)- 1 < oo, 

where 

fi = y]xq x <1. 

X 

Thus (differently from P) we may bias Q by introducing the probability 
measure 

Q(T = t) = (1 - n)s(t) Q(T = t). 

Note: Conditioning on size s(T) = n, it makes no difference, whether we 
consider Q or Q. If moreover M is chosen purely at random from the vertices 
of T, i.e. from {1,2,..., s(T)}, then 

Q(T = t, M = m) = (1 - n)Q(T = t). 
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Let (q x ) be the distribution obtained from (q x ) by size-biasing, i.e. 

q x = xq x /fJL, 
then we may rewrite the above formula as 

Q(T = t,M = m) = (1 - y)n 9 Y[ ^W^T II 

i<m j^m 

where g denotes the generation of individual m, and i < m means that 
individual i is a predecessor of m. Following Geiger |21j . this formula gives 
raise to a probabilistic construction for the size-biased Galton Watson tree: 

Construction of the tree T 

• Let G be a random variable with geometric distribution and parameter 
fi. Choose independent random variables £1, • • • ,£g with distribution 
(q x ). Let Cj be random numbers, taken independently and uniformly 
from {1,2,. ..,§}, j = l,...,G. 

• The trunk of T: Build up a line of G consecutive individuals, where 
the j'th individual has £j children. Let the first one be the tree's root 
and the (j + l)'th one be the Cj'th child (in the order of birth) of 
the j'th individual. Give the label M to the Cc'th child of the G'th 
individual. 



• The tree top of T: Besides these G individuals the trunk contains 
y = £i + ... + £G + l — G additional individuals. They propagate in 
the usual Galton- Watson manner. This means: In order to complete 
the tree T we attach independent Galton- Watson trees T±, . . . , T y with 
offspring distribution (q x ) to the trunk. 

In this manner we obviously obtain a size-biased tree together with an 
individual M, taken at random from the tree, and belonging to generation 
G: 

Q(f = t,M = m) = Q(T = t, M = m). 

Note that for this construction it is only required that (p x ) has mean 1. In 
the limiting case A = 1 (or equivalently fi = 1) G takes a.s. the value oo, 
and q x is equal to 

Px = Xp x . 

Then we get an infinite size-biased tree T, which already appeared in the 
work of Grimmett [22], Kesten [27], Aldous [2J and others. We use this tree 
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to describe the asymptotic shape of the lower part of a CGW(n)-tiee. For 
any tree t let t(k) be the tree, which results by cutting off all individuals in 
t belonging to a generation greater than k. Thus, t{k) has in generation k' 
the generation sizes zj~' for k! < k, and for k' > k. Further let T(k) be the 
set of trees with height at most k. 

Theorem 5 Let T be a Galton-Watson tree with an offspring distribution 
(Px) fulfilling assumption A, and let (k n ) be a sequence of natural numbers 
such that k n = o{n/a n ). Then for n — > oo 

sup \P(T(k n ) e B | s(T) = n) - P(f(k n ) G B)\ -> 0. 
BcT{k n ) 

In the special case of a Poisson offspring distribution this was proved by 
Aldous in [2] . We prepare the proof by gathering some facts of an analytical 
character. Let 

y<x 

In order that (p x ) belongs to the domain of attraction of a stable law with 
index a, it is necessary and sufficient, that v(x) varies regularly at infinity 
with exponent 2 — a. Then a n can be chosen as any sequence with the 
property 

a 2 n /v(a n ) = cn(l + o(l)) 

with some c > (compare |16| . chapter XVII. 5). It follows that a n varies 
regularly with exponent 1/a. We have a local limit law at our disposal, 
which in our case reads as follows. 

Proposition 6 Let £i,£2>--- be i.i.d. random variables with common dis- 
tribution (p x ), satisfying assumption A. Then uniformly in x G Z 

P(6 + • • • + " n = x) = a-yx/a n )(l + o(l)), 

where g denotes the (continuous and strictly positive) density of the limit 
law v. 

The proof can be found in [18 , chapter 9, up to an exceptional case, and 
in [33] in full generality. 

Lemma 7 Let £i, £2; • • • be independent random variables with distribution 
(p x )- Then assumption A implies convergence o/a" 1 ^! + ... + C[n/a n }) i- n 
distribution. 

Proof If v (00) = o" 2 < 00, then a n is asymptotically proportional to 
n 1 / 2 and (p x ) has finite mean, and the claim follows from the ordinary law 
of large numbers. The case a = 2,v(oo) = 00 is similar. Then v(x) = 
J2y< x (y ~ ^)Py ^ s a s l ow ly varying function. It follows a;P(£ > x) = o(v(x)), 
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as x — > oo (compare |16j . chapter VIII. 9, Theorem 2). This allows to apply 
a generalized law of large numbers, as given f.e. in |13j . chapter 1.5, exercise 
5.11. 

If a < 2, then x^2 y>x p y ~ (2 — a)(a — l)^ 1 v(x), as follows from the 

cited theorem in [16J. Therefore P(£ > x) is regular varying with exponent 
1 — a, such that £ belongs in distribution to the domain of attraction of 
a positive stable law. Moreover P(£ > a n ) ~ ca n /n for some c > 0, from 
which the claim follows by standard results on convergence in distribution 
to stable laws. □ 



Lemma 8 Let T, T\,T2,... be independent Galton- Watson trees with off- 
spring distribution (j>k)> satisfying assumption A. Then it follows: 
i) P(s(T) = n)= n- 1 a~ 1 5(0)(l + o(l)), as n -> oo. 
mJ n (s(Ti) + . . . + s(TLi)) converges in distribution. 

Proof i) is an immediate consequence of Proposition 6 and the following 
classical formula (see Dwass [TU] and Kolchin [2S], Lemma 2.1.3): 

P(s(T) =n) = -P(6 + . . . + Cn = n - 1). 

n 

Since a n is regularly varying with exponent 1/a, it follows P(s(T) > n) ~ 
(1 — a _1 )<7(0)a,^ 1 . Consequently s(T) belongs in distribution to the domain 
of attraction of a positive stable law, and ii) follows from standard results. 
□ 

We come to the Proof of Theorem 5. Together with T we consider an 
individual M, chosen at random from T. We already pointed out that 

P(T = t, M = m | s(T) = n) = Q(T = t, M = m | s(T) = n). 

Therefore instead of T we may consider the size-biased tree T, as obtained 
in the above construction. Let x\, . . . ,X\~ be offspring numbers of the lower 
k individuals in the trunk (including the root). Let the (j + l)'th individual 
in the trunk be the dj'th child of the j'th individual, with dj < Xj. Let 
t\, . . . , t z , z = x\ + . . . + Xk — k be the trees, growing out of these offspring to 
the right and left of the trunk. Because of the lack of memory of a geometric 
distribution 

Q(?i = xi,-- - = x k , 

Ci = di, . . . Xk = d k ,Tt = ii, . . . ,T Z = t z ,s(T) = n) 
= Q(G > k) q Xl . . . q Xk x^ X . . . x^ 1 

x Q(Tx = ti) . . . Q(T Z = t z ) Q(s(f) = n -k-s) 
= fj, k q Xl . . . qx^^ 1 ...x^ 1 Q(Ti = h) . . . Q(T Z = t z ) 

x (1 - fi)(n — k — s)Q{s{T) = n-k-s), 
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with s = s(t\) + . . . + s(t z ). As to the dependence on A it is easy to check, 
that the probabilities on the righthand side contain altogether the factor 
yT k \ n ~ l <p(\) n ■ Therefore there are numbers c(n, A) such that 

Q(fi = xi, . . . ,f fc = x k , 

Ci = di,... ,(k = d k ,Tx = ti,...,T z = t z ,s(T) = n) 
= c(n, A) p xi . ..p Xk x^ 1 ...x^ 1 P(Ti = ti) . . .P(T 2 = t z ) 
x(n-k- s)P(s(T) =n-k-s), 

in particular 

Q(s(f ) = n) = c(n, A) n P(s(T) = n). 
From Lemma 8 i) we get 

P(£i =a;i,... ,£ k = x k , 

Cl = di, . . . , Cfe = d k ,Tx = h, . . . ,T Z = t z | s(T) = n) 
~ p Xl ...p Xk Xi 1 ...x- 1 P(T 1 = t 1 )...P(T z = t z ), 

as long as k + s(^i) + . . . + s(t 2 ) = o(n). Now it is assumed that k = k n = 
o(n/a n ). From Lemma 7 it follows, that in probability z = ■ -+t;k — k n = 
o(a n ). Therefore in view of Lemma 8 s(Ti) + . . . + s(T z ) = o(n). Thus our 
claim follows. □ 

We use now Theorem 5 to verify the condition of Theorem 4 for a height 
process. 

Lemma 9 Let C n be the cumulative height process of a CGW {n)-tree, as 
defined in section 2. If assumption A is satisfied, then as e — > 0, 

limsupP(C^ < e) -> 

n 

for all u > 0. 

For the proof we need an estimate on the height of a Galton- Watson 
tree. 

Lemma 10 Let h(T) = maxjfc | Z k > 0} be the height of a Galton-Watson 
tree. Then assumption A implies P(h(T) > n/a n ) = c'a~ 1 (l + o(l)) for 
some d > 0. 

Proof Since v(x) is regularly varying, by Karamata's Tauberian Theo- 
rem (compare [16j, chapter XIII. 5) varies regularly at 1- with exponent 
a — 2, more precisely 

^"(A) = J>(* - l)X x ~ 2 p x ~ T(3 - a) v((l - A)- 1 ). 

X 
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It follows that y?'(A) — 1 and (f(\) — A vary regularly at 1-, too (compare 
PU, chapter VIII.9), and 

^(A)-A~ — ^— (l-A)V(A). 
a{a — 1) 

This allows us to apply Lemma 2 from Slack [37], which in our notation says 

<p(P(h(T) < re)) - P(h{T) < re) 1 

P(h(T) > n) (a - l)n' 

Combining these estimates and replacing n by n/a n we get 
P(h(T) > n/an) v(P(h(T) > n/a n y l ) 



T(3 — a) n 

Comparing this with v(a n )/a n ~ ca n /n, our claim follows. □ 

Proof of Lemma 9 We have to show that there are sufficiently many 
individuals in the bottom of the trees under consideration. This will be 
done first for the infinite size-biased tree T. Let T±,...,T Z be those Galton- 
Watson trees, which grow out of the trunk of T at an individual, belonging 
to a generation less than n/a n . Then, given n > 0, in view of Lemma 7 
there are numbers ci,C2, such that c\a n < z < C2a n with probability at 
least 1 — rj/3. In view of Lemma 10 the number of trees Ti,i < C2a n with 
h(Ti) > n/a n has asymptotically a Poisson distribution. Therefore there is 
a number I, such that 

limsupP(/i(Tj) > n/a n for at least I of the % < z) < rj/2. 

n 

On the other hand because of Lemma 8 the number of trees Ti,i < c\a n 
such that s(Tj) > Sn is asymptotically Poisson distributed, with a parameter 
going to oo, as 5 goes to 0. Therefore there is a 5 > such that 

limsupP(s(Tj) > 5n for less than I of the i < z) < rj/2. 

n 

Altogether we conclude: For any rj > there is a 5 > 0, such that with 
probability at least 1 — 77 there is a tree Tj,i < z, such that h(Ti) < n/a n 
and s(Tj) > Sn. Since Tj stems from an individual in a generation less than 
n/a 

n: w(3 see, that for large n the number of individuals in the first 2n/a, n 
generations of T is bigger than Sn with probability at least 1 — n. 

Now let e n > be a sequence, converging to 0. Choose S > and define 
m = m(n) by e n n = Sm. Then from the definition of C n in section 2 for 
u > 0, if n is large enough, 

P(C£<e n ) = P(Z +Z1 + ... + Z [nu/an] < Sm I s(T) = n) 
< P(Z + + + Z [2m/am] < Sm I s{T) = n). 
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Since m = o(n), we may in view of Theorem 5 switch over from T to T, 
therefore 

limsupP(C£ <£„)<?? 

n 

for all r] > 0. This holds for any sequence e n , thus, as is not difficult to see, 
the claim of the lemma follows. □ 



5 Convergence of Excursions 

In this section we prove convergence in distribution of the rescaled random 
walk S, conditioned on the event S(n) = 0, S(i) > for i < n. The usual 
way to define a random walk excursion is, to condition the random walk on 
the event S(n) < 0, S(i) > for i < n. Since we deal with a random walk, 
skipfree to the left (i.e. steps to the left cannot be bigger than 1), this makes 
no difference. Convergence of normalized random walk excursions seems to 
be studied only in the finite variance case, see Kaigh [23]. In this section we 
derive the required generalization. Let as in section 2 

S n (s) = a^Sdns}), < s < 1. 

Theorem 11 Under assumption A the processes S n , conditioned on the 
event S(n) = 0, S(i) > fori < n, converge in distribution to the normalized 
excursion Y of the corresponding Levy-process. It holds Yq = Y\ = and 
inls< s <isY s > a.s. for all 5 > 0. 

Before proving this result let us first complete the proof of our main 
theorem. 

Proof of Theorem 1 Recall the notation introduced in section 2. In 
view of Theorem 4 and Lemma 9 it remains to show that also the processes 
Y n converge in distribution. From the random walk representation of section 
2 it follows 

sup|C(fc + i )on/n - Cl anjn \ = ^supZfc < ^ sup S n {t). 

Because of Theorem 11 the righthand supremum converges in distribution. 
Since a n = o(n), we obtain 

sup \t n k+l - t n k \=o{l) 

k 

in probability, with i£ = C^ a Without loss we may again assume that 
this convergence as well as the convergence of S n to Y takes place in the 
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a.s. sense. This means, that there are a.s. functions a n : [0, 1] —> [0, 1] such 
that 

sup \S n (t) - Y(a n {t))\ -> 0, sup \a n {t) - t\ 0. 
t t 

The role of the a n is, as is well-known, to match the jump points si, S2, ■ ■ ■ 
of Y to those jumps of S n , which asymptotically are not negligible. These 
are given by a n (sj). Define j3 n (sj) = t£ +1 , if a n (sj) £ [*5bJ*5b+i) and j < d n . 
If d n is going sufficiently slowly to oo, we obtain by linear interpolation 
bijections (3 n : [0, 1] — > [0, 1], which match the jump points of Y to those of 
Y n . This implies, as is not difficult to see, that a.s. 

sup \Y n (t) - Y((3 n {t))\ -> 0, sup \/3 n {t) - t\ -> 0. 
t t 

Thus y™ —7- Y a.s. in the Skorohod sense, and our claim follows. □ 

The plan of the proof of Theorem 11 is, to reduce the theorem to con- 
vergence of random walks and random walk bridges. The first step, namely 
to generalize Donsker's theorem to the infinite variance situation, is fairly 
obvious. It is probabely known, though we could not find it in the literature 
(compare however Bloznelis [Jj and the papers cited therein). 

Proposition 12 Under assumption A the unconditioned processes S n con- 
verge in distribution to the Levy-process X fulfilling X\ = v in distribution. 

Proof Since S n and X have independent, stationary increments, con- 
vergence of the finite-dimensional distributions follows immediately from 

assumption A. Further |S n (r n + n ) - <S n (r„)| = \S n (9 n )\ -»• in probability 
for any sequence of positive numbers n > 0, going to 0, and any sequence 
r n of stopping-times, bounded uniformly from above. Now tightness follows 
from a criterion due to Aldous pQ. □ 

Next we discuss convergence of random walk bridges. For the construc- 
tion of Levy-bridges we refer the reader to [5], chapter VIII. 3. Random walk 
bridges are treated in the next proof in quite a similar spirit. 

Proposition 13 Under assumption A the processes S n , conditioned on the 
events S(n) = 0, converge in distribution to the Levy-process X, conditioned 
on the event X = 0. 

Proof Fix t G (0, 1) and let k : D — > R be a continuous functional, 
such that n(f), f E D does only depend on the values of f(s), s < t. Then 
by the Markov property 

E(«(S") \S(n) = 0) = E(K(S n )h n (S([tn}))) 
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with h n {x) = P(5(n) - S([tn]) = -x)/P(S(n) = 0). By Proposition 6 

, , , 9(-x/a n -[tn]) /g(P) M , 1V( 
h n (x) = J (l + o(l)) 

a n— [tn] a n 

= (1 - tr 1/a 9 (-*/a n _ [tn] )M0) (1 + o(l)) 
uniformly in x. Therefore 

EWS ,.) | S(n) = 0) = gwsw^il (1 + o(1)) . 

Proposition 12 implies 

E(K(X)g(-(l-t)-^X(t))) 



E( K (5 n ) \S(n) = 0) 



(l-t)V^(O) 



This proves convergence in distribution of the process (S n (s)) s <t, condi- 
tioned on the event S(n) = 0, for any < t < 1. Finally we have the 

duality relation (5 n (l — s)) s = (S n (0) — S n (s—)) s , thus also convergence of 
the processes (S n (s)) s >i-t follows for every < t < 1. Combining these 
results our claim follows. □ 

The proof also shows that for any t < 1 the distribution of the process 
(X s ) s <t, given X\ = 0, is absolute continuous with respect to the distribu- 
tion of the unconditioned process {X s ) s <t (which follows as well from the 
construction in [5]). This allows to transfer properties. We need the fol- 
lowing one: The Levy-process X attains on [0, 1] its minimal value a.s. at 
exactly one point, namely 

T = inf{s < 1 | X t > min(X s ,X s „) for all t > s}. 

Also T < 1 a.s., and X is a.s. continuous at T. This follows from Proposi- 
tions 2.1 to 2.4 in Millar [32j (for our process is regular for both (— oo,0) 
and (0, oo)). By absolute continuity this carries over to the process (X s ) s <i, 
conditioned on X\ = 0. 

There is an easy recipe using cyclic permutation of a path, which allows 
to pass over from a bridge 1 = 5(0), 5(1), . . . ,S(n — l),5(n) = to an 
excursion 1 = 5(0), 5(1) > 0, . . . ,5(n — 1) > 0,5(n) = 0, and which has 
been utilized by different people (see f.e. [38J). Let 

T n = min{i < n \ S(j) > S(i) for all j = i, i + 1, . . . , n} 

be the moment, when 5 takes its first minimal value before time n. Then 
5, given by 



5(i) 



S(T n +%)- S(T n ) + 1, if i < n - T n 
5(T n + i-n) - S{T n ), if i>n-T n , 
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is an excursion for any bridge S. In this manner we associate to each excur- 
sion n + 1 different bridges, one being the given excursion itself. 
This motivates to associate for any / 6 D an / £ D, given by 

7m = / /(t + T(/)) " /(T(/)) + /(0) ' if * < 1 " T(f) 
J{) \ f(t + T(f) - 1) - /(T(/)) + /(l), if t > 1 - T(f), 

where 

T{f) = min{s < 1 | f(t) > min(/(s),/(s-)) for all t > s}. 
Thus 7(0) = /(0), 7(1) = /(!)• 

Lemma 14 Suppose, that f £ D attains its minimum at no other point 
than T(f). Then f n — >■ / implies f n —>fin the space D. 

Proof f n — > f means that there are bijections a n , such that 

sup \f n (t) - f{a n (t))\ -)• 0, sup \a n {t) - t\ ->• 0. 

t t 

From the uniqueness of the minimum of / it follows 

T(f n )^T(f). 
Now let 7/„ > be numbers going to zero and define 



a n (t + T(f n )) - T(f), if Tj n < t < 1 - T(/ n ) 

«n(t + T(/ n ) - 1) + 1 - T(f), if 1 - T(f n ) <t<l- Vn . 



Also put /3 n (0) = 0, (3 n (l) = 1 and continue (3 n by linear interpolation on 
the whole interval [0, 1]. If rj n goes to zero slowly enough, (3 n is a bijection 
of [0, 1], and 

sup\/3 n (t)-t\ 

Further 

SUp|/ n (t)-7G8n(*))l 

t 

< 2 sup \ f n (t) - f(a n (t))\ 

t 

+ 2 sup \fn(s)-f(a n (t))\ 

T(U)-Vn<s,t<T(f n )+r, n 

< 4sup|/ n (t)-/(o n (*))| 

+ 2 sup |/(a n (a))-/K(*))|. 

7 1 (/„)-r ?n <s,t<T(/„)+r ? „ 

If T(/) is a point of continuity of /, then the righthand terms all go to zero, 
and it follows the claim f n — > f. The case that / has a jump at T(f) (which 
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will not be considered in the sequel) is treated similarly; then T(f n ) = T(f), 
for n large enough. □ 
Proof of Theorem 11 Proposition 13 shows that the rescaled random 
walk bridges S n converge in distribution to a Levy-bridge X. Due to Skoro- 
hod's theorem we may assume without loss of generality, that S n — > X a.s. 
in the Skorohod topology. Since X attains its minimum a.s. at a unique 
point, which is a point of continuity, it follows from Lemma 14 S n — > X a.s.. 
By construction Y = X has the stated property. □ 

This method of deriving a the normalized excursion Y from the bridge 
X, is well-known for Brownian motion, and in the general case discussed in 
detail in Chaumont [9]. 
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